Web robot detection in the scholarly information environment

نویسندگان

  • Paul Huntington
  • David Nicholas
  • Hamid R. Jamali M.
چکیده

An increasing number of robots harvest information on the world wide web for a wide variety of purposes. Protocols developed at the inception of the web laid out voluntary procedures in order to identify robot behaviour, and exclude it if necessary. Few robots now follow this protocol and it is now increasingly difficult to filter for this activity in reports of on-site activity. This paper seeks to demonstrate the issues involved in identifying robots and assessing their impact on usage in regard to a project which sought to establish the relative usage patterns of open access and non-open access articles in the Oxford University Press published journal Glycobiology, which offers in a single issue articles in both forms. A number of methods for identifying robots are compared and together these methods found that 40% of the raw logs of this journal could be attributed to robots.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Navigation System for Autonomous Robot Operating in Unknown and Dynamic Environment: Escaping Algorithm

In this study, the problem of navigation in dynamic and unknown environment is investigated and a navigation method based on force field approach is suggested. It is assumed that the robot performs navigation in...

متن کامل

Motion detection by a moving observer using Kalman filter and neural network in soccer robot

In many autonomous mobile applications, robots must be capable of analyzing motion of moving objects in their environment. Duringmovement of robot the quality of images is affected by quakes of camera which cause high errors in image processing outputs. In thispaper, we propose a novel method to effectively overcome this problem using Neural Networks and Kalman Filtering theory. Thistechnique u...

متن کامل

Development of RadRob15, A Robot for Detecting Radioactive Contamination in Nuclear Medicine Departments

Accidental or intentional release of radioactive materials into the living or working environment may cause radioactive contamination. In nuclear medicine departments, radioactive contamination is usually due to radionuclides which emit high energy gamma photons and particles. These radionuclides have a broad range of energies and penetration capabilities. Rapid detection of radioactive contami...

متن کامل

Mobile robot wall-following control using a behavior-based fuzzy controller in unknown environments

This paper addresses a behavior-based fuzzy controller (BFC) for mobile robot wall-following control.The wall-following task is usually used to explore an unknown environment.The proposed BFC consists of three sub-fuzzy controllers, including Straight-based Fuzzy Controller (SFC),Left-based Fuzzy Controller (LFC), and Right-based Fuzzy Controller (RFC).The proposed wall-...

متن کامل

A density based clustering approach to distinguish between web robot and human requests to a web server

Today world's dependence on the Internet and the emerging of Web 2.0 applications is significantly increasing the requirement of web robots crawling the sites to support services and technologies. Regardless of the advantages of robots, they may occupy the bandwidth and reduce the performance of web servers. Despite a variety of researches, there is no accurate method for classifying huge data ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Information Science

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2008